Skill Chaining: Skill Discovery in Continuous Domains

نویسندگان

  • George Konidaris
  • Andrew Barto
چکیده

We introduce skill chaining, a skill discovery method for continuous domains. Skill chaining produces chains of skills leading to a salient event—where salience can be defined simply as an end-of-task reward, or as a more sophisticated heuristic (e.g., an intrinsically interesting event (Singh et al., 2004)). The goal of each skill in the chain is to reach a state where its successor skill can be executed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Skill Discovery in Continuous Reinforcement Learning Domains using Skill Chaining

We introduce skill chaining, a skill discovery method for reinforcement learning agents in continuous domains. Skill chaining produces chains of skills leading to an end-of-task reward. We demonstrate experimentally that skill chaining is able to create appropriate skills in a challenging continuous domain and that doing so results in performance gains.

متن کامل

Implementing Cst in Learning Layer of Csia for Higher Level of Intelligence

Development of cognitive architecture where the agents at different levels exhibit different levels of thinking. The paper primarily focus on building the skill tree at the learning layer of the architecture. These include the discovery of one’s own body, including its structure and dynamics. Also the acquisition of associated cognitive skills such as self and non-self-distinction. This can be ...

متن کامل

Learning Graph-Based Representations for Continuous Reinforcement Learning Domains

Graph-based domain representations have been used in discrete reinforcement learning domains as basis for, e.g., autonomous skill discovery and representation learning. These abilities are also highly relevant for learning in domains which have structured, continuous state spaces as they allow to decompose complex problems into simpler ones and reduce the burden of handengineering features. How...

متن کامل

Learning the Structure of Continuous Markov Decision Processes

There is growing interest in artificial, intelligent agents which can operate autonomously for an extended period of time in complex environments and fulfill a variety of different tasks. Such agents will face different problems during their lifetime which may not be foreseeable at the time of their deployment. Thus, the capacity for lifelong learning of new behaviors is an essential prerequisi...

متن کامل

Constructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories

We introduce CST, an algorithm for constructing skill trees from demonstration trajectories in continuous reinforcement learning domains. CST uses a changepoint detection method to segment each trajectory into a skill chain by detecting a change of appropriate abstraction, or that a segment is too complex to model as a single skill. The skill chains from each trajectory are then merged to form ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009